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ABSTRACT 

We present a parametric strong lensing model of the cluster Abell 2218 based on HST ACS data. 
We constrain the lens model using 8 previously known multiply imaged systems, of which 7 have 
spectroscopically confirmed redshifts. In addition, we propose five candidate multiply imaged systems 
and estimate their redshifts using our mass model. The model parameters are optimized in the source 
plane by a bayesian Monte Carlo Markov Chain as implemented in the the publicly available software 
Lenstool. We find rms^ = 0'.'12 for the scatter of the sources in the source plane, which translates into 
rmSi — 1'.'49 between the predicted and measured image positions in the image plane. We find that 
the projected mass distribution of Abell 2218 is bimodal, which is supported by an analysis of the light 
distribution. We also find evidence for two structures in velocity space, separated by ~ 1000 km s^^, 
corresponding to the two large scale dark matter clumps. We find that the lensing constraints can 
not be well reproduced using only dark matter halos associated with the cluster galaxies, but that 
the dark matter is required to be smoothly distributed in large scale halos. At 100" (291 kpc) the 
enclosed projected mass is 3.8 x 10^^ Mq. At that radius, the large scale halos contribute ~ 85% 
of the mass, the brightest central galaxy (BCG) contributes ^ 9% while the remaining ~ 6% come 
from the other cluster galaxies. We find that the model is not very sensitive to the fainter (and 
therefore by assumption less massive) galaxy sized halos, unless they locally perturb a given multiply 
imaged system. Therefore, dark galaxy sized substructure can be reliably constrained only if it locally 
perturbs one of the systems. The massive BCG and galaxies which locally perturb a multiply imaged 
system are reliably detected in the mass reconstruction. In an appendix we give a self-contained 
description of the parametric profile we use, the dual pseudo isothermal elliptical mass distribution 
(dPIE). This profile is a two component pseudo isothermal mass distribution (PIEMD) with both a 
core radii and a scale radii. 

Subject headings: dark matter — galaxies: clusters: individual (Abell 2218) — gravitational lensing 



1. INTRODUCTION 

Dark matter dominates over baryonic matter in the 
universe, but its nature is not known. The study of 
the inner parts of dark matter halos can give insight 
into the nature of the dark matter, as the steepness 
of the profile is correlated with the interaction be- 
tween the dark matter itself and with the baryonic 
matter. According to ACDM simulations, the mass 
distribution of galaxy clusters should be dominated by 
their dark matter halos. Gravitational lensing, which 
is sensitive to the total matter distribution, visible 
or dark, is ideal for studying the mass distribution 
of clusters. Strong lensing features, consisting of 
multiply imaged and strongly distorted background 
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sources, provide constraints on the inner parts of the 
cluster, while weak lensing features, consisting of weakly 
distorted singly imaged background sources, provide 
constraint s on the outer slope of the s urface mass profile 
(see e.g.. ISmail fc Dickinson ! 1 19951; ISmail et all 11995]: 
Kneib et al.l 119961: ISmail et a l. 1997; Abdelsalam ct aQ 
1998"; 'Nataraian et al.' '2002b; Bradac ct aL| 12001 : 
Gavazzi & Soucail 2007; Limousin ct al. 2007b|). 

Lensing can therefore provide unique information 
about the total mass distribution of clusters, from the 
inner to the outer parts. In addition, lensing can in prin- 
ciple be used to deduce various cosmological parameters 
(e.g. i?0: ^A: ^m)- This has already bee n extensively ap- 
phed to lensing on galaxy sca les (see e.g.. lSchechter et aLl 
Il997t iKoopmans et al.l[2003f ) and to a smaller degree for 
lensing on cluster sca les (see e.g., ISoucail et al.l 120041 : 
iMeneghetti et alll2005f ). where the accuracy of the mass 
map is a limiting factor. The accuracy of the mass map 
is strongly dependent on the number of multiply imaged 
systems used to constrain it. Therefore, to construct 
a robust model of the dark matter distribution, accu- 
rate enough for cosmography and for using the cluster 
as a gravitational telescope, it is important to include as 
many spectroscopically c onfirmed multipl y imaged sys- 
tems as possible (see e.g.. lEllis et al.l[200ll ). 

Abell 2218 is one of t he richest clusters in the Abel l 
galaxy cluster catalog (jAbelll 11958t lAbell et al.l 11989f) 
and has been successfully exploited as a gravitational 
lens. A parametric lens model has previously been 
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constructed by iKneib et al.1 (|1995l |1996[ ) (using 1 and 
2 s pectroscopically confirme d sys tems respectively) and 
by iNataraian et al.l (|2002bl . |2007[ ) (using 4 and 5 spec- 
tr oscopically confirme d systems) building on the model 
of iKneib et al.l (|1996l ) and including weak lensing con- 
straints from HST WFP C2 data. A non-pararn etric 
model was constructed by lAbdelsalam et al.l ()1998[ ) us- 
ing three spectroscopically confirmed multiply imaged 
systems. In all of these models, a bimodal mass dis- 
tribution was required to explain the image configura- 
tions (i.e. the models include two large scale dark matter 
clumps), but the number of constraints were not suffi- 
cient to accurately constrain the second large scale dark 
matter clump. Abell 2218 h as also been u sed as a grav- 
itational telescope, with Ellis ct al. (2001') discovering a 
source at z = 5.6 and Kncib et al. (2004^ discovering 
an even more d i stant source at z ~ 6.7, la ter confirmed 
bvlEgami et al.l (|2005D using Spitzer data. ISoucail et al.l 
estimated cosmological parameters based on a 
lensing model of Abell 2218 using 4 multiply imaged sys- 
te ms. The late s t pub lished lens model of Abell 2218 is 
bv ISmith et al.l (|2005D . who incorporated four multiply 
imaged systems and weak lensing constraints, using the 
WFPC2 data. Although the number of constraints has 
increased from the initial models, all previous models 
have assumed that the location of the second dark matter 
clump coincided with the brightest galaxy in the South 
East, due to a lack of constraints in its vicinity. 

The motivation for revisiting the modeling of Abell 
2218 comes from the new ACS data which have not been 
used before for modeling this cluster and are superior in 
both resolution, sensitivity and field of view to the previ- 
ous WFPC2 data set. These new high quality data allow 
us to identify several subcomponents in previously known 
multiple images, thus adding more constraints, and in 
one case, two more multiple images of a known system. 
In addition, we have measured a spectroscopic redshift 
for an arc around the second dark matter clump, which 
our model predicts to be singly imaged. We also have sev- 
eral new candidate multiply imaged systems, which we 
add as constraints and estimate their redshift with the 
model. In total we have 7 multiply imaged systems with 
measured spectroscopic redshift and 6 multiply imaged 
systems without spectroscopic data (of which 5 are new 
candidate systems). Finally, the lensing code Lenstool, 
has undergone significant improvements from previous 
models, with the incorporation of a Monte Carlo Markov 
Chain (MCMC), which enables us to not only find the 
best model in the lowest sense, b ut the most likel y 
model as measured by its Evidence (| Julio et al.ll2007f ). 
The MCMC also allows for a reliable estimate of the un- 
certainties in the derived model parameters. 

The paper is organized as follows: In Section [2l we 
give an overview of the data used in this paper. We 
compile a list of all currently known and new multiply 
imaged systems in Abell 2218 and discuss the reliability 
of the redshift estimate of each one in Section [3l The 
methodology of the strong lensing modeling is described 
in Section |31 In Section [S] we present the results of our 
lensing analysis, and compare them to previous models. 
In Section [6] we discuss degeneracies in the modeling. In 
Section [7] we address how reliable our model is, and dis- 
cuss the smoothness of the dark matter distribution. In 
Section[8]we interpret the bimodality of our model, along 
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Fig. 1. — Color-magnitude diagrams, ( V-Z) and (I-Z), used for 
selecting the cluster members (see Section l2.ll . The red quadrilat- 
erals define the red sequence and galaxies lying within are consid- 
ered cluster members. 

with X-ray measurements and an analysis of the distri- 
bution of cluster members in velocity space. We sum- 
marize our main conclusions in Section [9l Throughout 
the paper, we adopt a fiat A-dominated Universe with 
»A = 0.7,0^ = 0.3 an d = 70 kms"^ Mpc"^ Follow- 
ing [SmitF^eOn (|2005D we place the cluster at z = 0.171. 
At this redshift 1" corresponds to 2.91 kpc in the given 
cosmology. 

2. DATA 

We use data from several different sources for our lens 
modeling and analysis. The basis of our modeling is Ad- 
vanced Camera for Surveys (ACS) data from the Hubble 
Space Telescope, which allows us to identify and accu- 
rately locate multiply imaged systems. Our cluster mem- 
ber catalog is also selected using the ACS data and the 
magnitude of each cluster member is given in the Ingrid 
K-band of the William Herschel Telescope. In addition, 
we have obtained a spectroscopic redshift of a system us- 
ing the Keck telescope and archival Chandra X-ray Ob- 
servatory data have been used to produce an X-ray map 
of Abell 2218. 

2.1. The galaxy catalog 

Cluster members were selected based on the ACS data 
using the characteristic cluster red-sequences (V-Z) and 
(I-Z) in two color-magnitude diagrams and selecting ob- 
jects lying within the red-sequences of both diagrams 
(see Figure [1]) . The ACS data were used to select the 
galaxy members due to their better photometric accu- 
racy compared to the K-band data. Nevertheless, the 
catalog is given in the K band since it is more representa- 
tive of the galaxy population in elliptical galaxies. This 
selection gave 203 cluster galaxies down to K = 19.6. 
However, we rejected six galaxies from the catalog leav- 
ing 197 cluster members, of which four have measured 
redshifts showing them to be background galaxies. 

2.2. X-ray data 

The X-ray data were taken with the Advanced CCD 
Imaging Spectrometer (ACIS) on NASA's Chandra X- 
ray Observatory. The ACIS is an array of 10 CCD's, 
capable of simultaneously imaging and measuring the en- 
ergy of the incoming X-rays. We combined three images 
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Fig. 2.— A color image of Abell 2218 based on ACS data (F775W, F625W and F475W filters in the red, green and blue channel 
respectively). The cluster galaxies are marked in yellow (modeled using scaling relations) or blue (individually fitted). The multiple 
images are labeled in green for spectroscopically confirmed systems and red for candidate systems. The arc for which we have obtained 
spectroscopic redshift, S8, is labelled in cyan. Also shown arc the critical lines corresponding to z = 0.702 (cyan), z = 2.515 (red) and 
2 = 6.7 (green). 



taken at different epochs to construct our final X-ray map 
of AbeU 2218. Two of the images were taken in October 
1999 and one in August 2001, with 5960 s, 11560 s and 
49240 s exposure times respectively. The data reduction 
was performed using standard pipelines from the Chan- 
dra Interactive Analysis of Observations (CIAO) soft- 
ware. The X-ray data will be used for comparison with 
the lensing data in Section [S] 

3. MULTIPLY IMAGED SYSTEMS 

Multiply imaged systems are the basis of our analysis, 
and are particularly useful as constraints if their redshifts 
are accurately determined. In this section we list the 
multiple images, with or without spectroscopically con- 
firmed redshifts, used as constraints in our lens modeling. 
We propose several new candidate systems found in the 
ACS data and present a spectroscopic redshift measure- 
ment for an arc which lies in the region of the second 
dark matter clump. All systems are listed in Table[T]and 
shown on Figure [21 

3.1. Previously known systems 

The first syste m, system SI, is a t a red shift oiz — 0.702 
as measured by iLe Borgne et al.l (|1992D (system 359 in 
their catalog). Thanks to the new ACS observations, it 
has seven ide ntified images (pre viously it had five iden- 
tified images (Knei b et al.lll99 6l)), of which two pairs are 
located close to individual galaxies #1028 and #993 in 
the galaxy catalog) which affect the lensing signal. 



System S 2 is a star-forming galaxy at a redshift of 
z = 2.515 fEbbel s et aI1ll996t) . It consists of a fold 
a nd a counter image (listed as images #384 and #468 
in lEbbels et al.l ()1996[ )). We separate the fold image into 
two images and identify four components, giving us four 
sets of triple images from this system (referred to as S2.1, 
S2.2, S2.3 and S2.4). 

System S3 is a faint star-forming doubly ima ged system 
at a h igh redshift z = 5.576 discovered by lEUis et all 
(|200ll) . 

System S4 is a tripl y imaged submillime ter source at 
z = 2.515 reported bv lKneib et al.l (|2004bD . In each im- 
age they identify three components labelled a, 7, f3. We 
identify all three images in the ACS data. However, we 
only detect the a and (3 components. We include both as 
constraints on the model (referring to them as S4.1 and 
S4.2 respectively). Image S4.1 seems to consist of further 
three subcomponents. As we do not clearly distinguish 
them in all three images, we do not add them as separate 
constraints. 

The fifth system, S5, is a triply imaged system, be- 
lieved to be the triply imaged outskirts of an associ- 
ated singly imaged galaxy. It is believed that the galaxy 
partially crosses the caustic, with the central part be- 
ing singly imaged a nd the outskirts being triply imaged. 
iKneib et al.l ()2004aD measured a redshift of 2 = 2.515 for 
the galaxy (labelled as #273 in their notation) but an 
independent spectroscopic redshift has not been deter- 
mined for the faint triply imaged component. Although 
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TABLE 1 
Lensed systems 



System id R.A (J2000) Dec. {J2000) 



rt 



Sl.a 
Sl.b 
Sl.c 
Sl.d 
Sl.e 
Sl.f 
Sl.g 



248.96976 
248.96646 
248.96478 
248.96218 
248.95876 
248.95790 
248.95500 



66.212219 
66.208627 
66.209439 
66.207296 
66.206237 
66.206762 
66.205771 



0.702 
0.702 
0.702 
0.702 
0.702 
0.702 
0.702 



S2.1.a 
S2.1.b 

52.1. C 

52.2. a 
S2.2.b 

52.2. C 

52.3. a 
S2.3.b 

52.3. C 

52.4. a 
S2.4.b 
S2.4.C 



248.95540 
248.95451 
248.93684 
248.95562 
248.95426 
248.93681 
248.95642 
248.95441 
248.93723 
248.95405 
248.95715 
248.93728 



66.218525 
66.218469 
66.206345 
66.218536 
66.218458 
66.206424 
66.218428 
66.218347 
66.206355 
66.218258 
66.218375 
66.206269 



2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 
2.515 



"S3ir 

S3.b 



248.96646 
248.96591 



66.214324 
66.212701 



5.576 
5.576 



S4.1.a 
S4.1.b 

54.1. C 

54.2. a 
S4.2.b 
S4.2.C 



248.97894 
248.97519 
248.96075 
248.97981 
248.97463 
248.96296 



66.210224 
66.206897 
66.201462 
66.210279 
66.206179 
66.201575 



2.515 
2.515 
2.515 
2.515 
2.515 
2.515 



S5.a 
S5.b 
S5.C 



248.981043 
248.977680 
248.974235 



66.205816 
66.203939 
66.202425 



2.515 
2.515 
2.515 



Comments 



(1) 



"W" 



S6.a 
S6.b 


248.9855 
248.98371 


66.200483 
66.199906 


1.034 
1.034 


(4) 


S7.a 
S7.b 
S7.C 


248.94254 
248.94244 
248.95763 


66.216974 
66.216909 
66.220132 


1.03 
1.03 
1.03 


(5) 


S8.a 


248.98805 


66.19633 2.74 




Cl.a 
Cl.b 
Cl.c 


248.97735 
248.97585 
248.95303 


66.210845 
66.208989 
66.200703 


--6.7 
-6.7 
-6.7 


(6) 


C2.a 
C2.b 
C2.C 


248.95398 
248.96247 
248.97692 


66.203803 
66.204748 
66.216365 


2.6 ±0.1 
2.6 ±0.1 
2.6 ±0.1 




C3.a 
C3.b 


248.93879 
248.93749 


66.22238 
66.221662 


2.8 ±0.6 
2.8 ±0.6 




C4.a 
C4.b 


248.97708 
248.97542 


66.210108 
66.208369 


2.2 ±0.2 
2.2 ±0.2 




C5.a 
C5.b 


248.97543 
248.97556 


66.209244 
66.209383 


2.3 ±0.8 
2.3 ±0.8 




C6.a 
C6.b 


248.96481 
248.96505 


66.213039 
66.214017 


2.6 ±0.3 
2.6 ±0.3 





Note. — The multiply imaged systems used to constrain the model. 
The systems and their properties, along with refences to the papers 
reporting their redshifts, are given in Section \3\ Systems C2-C6 are 
new candidate systems reported in this paper. Systems with a spec- 
troscopic redshift measurement are denoted by an S in their id, while 
those without spectroscopic redshifts are denoted by a C. Roman let- 
ters (a,b,c,...) denote the different images of each set of multiple im- 
ages. If several components are identified in system, each component 
is sublabelled, e.g. 2.1 and 2.2 denote two different components of the 
same multiple imaged system. (1) Two pairs of images are caused by 
splitting by a local cluster galaxy. (2) Also known in the literature 
as #384 and #486. (3) The redshift is measured for an associated 
singly imaged galaxy (4) Also known in the literature as #289 (5) 
Also known as #444 and H6. The redshift estimate is based on one 
feature. (6) Redshift estimate based on photometric redshift, lensing 
and a tentative Lya break. 

t Lens model redshift predictions are reported in square brackets. 




Fig. 3. — The new candidate system, C2, identified in the ACS 
images. The three panels show the three images of C2, marked in 
Figure [2] The colors are the same as in Figure [2] The system is 
characterized by a central 'yellow' spot, flanked by blue spots. 




Fig. 4. — The new candidate arcs identified in the ACS images. 
Left panel: The candidate arc C3 (green circles). Middle panel: 
The candidate systems C4 (green circles) and C5 (yellow circle). 
The red circles are the locations of two of the images of system 
CI (at 2 = 6.7), while the blue circle shows the location of images 
belonging to system S4 (components a and b). Right panel: The 
candidate system C6 (green circles). The high redshift system S3 
(z = 5.6) is marked with red circles. 

it is likely (and consistent with the models) that they 
belong to the same background source, it is not certain, 
making the redshift determination for S5 less certain. 

The sixth sy stem, S6, consists of an arc in two parts 
at z = 1.034 (jKneib et all 119961: ISwinbank etal] l2003l 
called 289 in their notation). 

The seventh system, S7, first reported by lKneib et al.l 
(1996), consists of a merging image and a c ounter image 
(also known as #444 and H6 respectively) . lEbbels et al.l 
([1998) measured a spectroscopic redshift at z = 1.03 
for the merging image but this is listed as a tentative 
identification (as the only identified feature in the spectra 
was the Fe II doublet at AA2587, 2600). 

The final previously known multiply imaged system, 
CI, is a hig h redshift (z ~ 7) tr i ply im aged system 
(jKneib et al.| [2004al. Egami et all (|2005l ) further con- 
strained the redshift of the system to lie in the range 
6.6 — 6.8. For the purposes of the modeling we will as- 
sign z — 6.7 to this system, noting that changing the 
redshift within the range 6.6 — 6.8 does not affect the 
model. 

3.2. New candidate systems 

We propose 5 new candidate systems which we have 
identified in the new ACS data. We do not have spec- 
troscopic redshift determination for these systems, but 
estimate their redshifts using the model prediction. 

The first candidate is a triply imaged system, which 
we call C2. In Figure [3] we show color stamps of the 
three images, and the locations are given in Table[TJ The 
morphology of the three images, suggests that it is of 
the same background source, with a 'yellow' spot in the 
middle, fianked by a fainter blue on the sides. 

We also identify a new faint arc in the north-west part 
of the cluster, which we call C3 (see Figure |4|). In the 
vicinity of system 6, we further identify two arclike im- 
ages, which we label C4 and C5. The first, C4, is a very 
faint blue double arc, while in the second, C5, two bright 
spots of a merging arc can be seen. Finally, a pair of 
blue extended images, C6, is found in the vicinity of sys- 
tem 3. These candidate systems are shown in a color 
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Fig. 5. — The alignment of the slit for the spectra of S8. The 
box shows the area used to extract the spectrum (see Figure |6j. 



1300 1400 




Wavelength (A) 

Fig. 6. — The spectrum of system S8. The plot shows lines for the 
arc at 2 = 2.74 (blue) and the nearby galaxy (#617) at z = 0.177 
(red). The spectrum shows Lyo absorption and emission and a 
number of absorption lines in the UV which are marked in the 
figure. At this redshift, z = 2.74, the model predicts S8 to be 
singly imaged. 

image in Figure |4] and are Hsted in Table [1] along with 
their redshift estimates (see Section [53] for details). The 
positions of these images are used to constrain the final 
model, but their redshift is kept free in the modeling. 

3.3. Spectroscopic redshift for an arc 

We have used the Low R esolution Imager and Spec- 
trograph (LRIS. [Oke et al.l (|1995, )) at Keck to measure 
the spectroscopic redshift of an arc, S8, to the south- 
east of the second dark matter clump (see Figure H]). 
Two exposures of 1800 seconds were obtained on June 
29th 2007 with a 175" x 1" long slit placed along the 
brighter components of this arc (Figure[5]). A 600 1 mm~^ 
grism blazed at 4000 A and a 400 1 mm^^ grating blazed 
at 8500 A were used in the blue and red channels of 
the instrument, both lightpaths being separated by a 
dichroic at 5600 A. The corresponding dispersions are 
0.6/1.85 A and resolutions are 4.0/6.5 A in the blue/red 
channel, respectively. 

The resulting spectrum (see Figure IH]) is dominated by 
the light coming from the very bright neighbor cluster 
member at z — 0.177, yet shows Lyman-a in emission 
and UV absorption features of Sill, CI, CII and CIV 
from the arc, giving a redshift z — 2.74. These fea- 
tures are not compatible with any line at the redshift of 
the lens galaxy, giving a redshift class of 2 for this spec- 
trum, with 75% pr obability of be i ng co rrect, following 
the classification of iLe Fevre et al.l (|1995[ ). Our model is 
not consistent with this object being multiply imaged. 



and when including it as a constraint as multiply im- 
aged (using spots in the arclike structure as independent 
images), the model predicts additional counter images 
which are not seen. We therefore conclude that although 
this arc is lensed, it is not multiply imaged, and include 
it as a singly imaged system in the model constraints. 

4. MODELING 

Following e.g . iKneib et al.l (|1996D : lSmith et all (|2005f) : 
iLimousin et"aLl (|2007bD we do a parametric mass recon- 
struction of Abell 2218. The multiply imaged systems 
form the basis of our analysis, with each n-tuply imaged 
system giving 2(n— 1) constraints if the redshift is known. 
As the number of constraints needs to be greater than the 
number of free parameters in our fit, we are limited by 
the known multiply imaged systems. The MCMC sam- 
pling and optimization is done usiri g the Lenstool soft- 
ware ()Kneiblll993t lJullo et al.ll2007t ). The optimization 
is performed in the source plane, as it is faster and has 
been found to be equivalent to optimizing in the image 
plane (Halkola et al. 2006; Julio et al. 2007). The new 
version of Lenstool ijJuUo et al.ll2007f ). gives a distribu- 
tion of values for each of the parameters, thus making it 
possible to estimate the uncertainty of the parameters. 
In addition, it returns the Evidence of a model, which is 
a measure of how likely a model is, penalizing unneces- 
sarily complicated models. Thus, a model with a lower 
but more free parameters, may have a lower Evidence, 
suggesting that we should choose the simpler model. For 
each image we find its rms (root-mean-square) value for 
its position in the source plane, rms^, and image plane, 
rmsi, given by 



1 " 
n ^ ( 



XI ~ < Xi > 



rms,; 



A 



Xo 



(1) 



(2) 



where n is the number of images for the system, Xg is 
the position in the source plane, X the position in the 
image plane and Xohs the observed position in the image 
plane. The overall rms is defined by summing and aver- 
aging over all the images for all the systems. A detailed 
overview of the Lenstool software, incl uding defini t ions o f 
and the Evidence, can be found in I Julio et al.l (|2007f ). 



X 



4.1. Model components 



We refer to the individual components of the model as 
'clumps', where each clump is denoted by its position, 
ellipticity, position angle and the parameters of the pro- 
file used to describe it. The parametric profile we use is 
the dual Pseudo Iso thermal Elliptical mass dis tribution 
(dPIE, derived from lKassiola fc Kovned (|1993lJ )) and its 
form and main properties are given in Appendix]^ The 
dPIE profile is defined in Lenstool by three parameters, 
the core radius a, the scale radius s and a fiducial veloc- 
ity dispersion (TdPiE- For a < r < s, the profile behaves 
as p ~ r~^, while it falls like in the outer regions. 
For a vanishing core radius the scale radius corresponds 
to the radius containing half the 3D mass. 

The clumps are called 'large scale clumps' if their 
mass within the outermost multiply imaged constraint 
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is greater than 20% of the total mass. SmaUer clumps 
are referred to as 'galaxy scale clumps', and are in gen- 
eral associated with the cluster members. Large scale 
dark matter clumps are optimized independently. The 
redshift is fixed at the location of the cluster, but the 
central position (R.A., Dec), the ellipticity and the po- 
sition angle (P.A.) are allowed to vary. 

We associate a galaxy scaled clump with each of the 
cluster galaxies, fixing the central location, ellipticity and 
position angle of the mass distribution to that of the 
light distribution. A few of the cluster galaxies are fitted 
individually (optimizing their a, s and UdPiE, see Table 
[2] for an overview), but due to a lack of constraints, most 
of the galaxies are optimized in a combined way. The 
parameters (a, s, adprn) are optimized together using the 
following scaling relations for the luminosity L: 



L 



CTdPIE - cr^piE I 



1/2 



1/2 



1/4 



(3) 
(4) 
(5) 



For a discussion of th ese scaling relation s we refer to 
iLimousin erall (j2007bD and Halkola ct al] (j2007f ). For a 
given L* luminosity, we fix a* = 0.25 kpc, while cr^pjg 
and s* are allowed to vary. We note that fixing the 
core radius to be small, makes th e profile approxi r nately 
equivalent to the profile used by iBrainerd et al.l ()1996D 
to describe galaxies (s ee also Appendix Following 
Ide Propris et al.l ()1999[ ) we take the apparent magnitude 
of an L* in the K-band to be K* = 15 at z = 0.171 
(redshift of AbeU 2218). 

5. THE STRONG LENSING MASS DISTRIBUTION 

In this section we present our strong lensing model 
(optimized in the source plane), discuss its implications 
and compare it to previous results. All reported error 
bars correspond to 68% confidence levels. For our best 
model we find rms^ = 0'.'12, which gives rmsi = 1'.'49 (see 
Table reftab:chires).* 

5.1. A bimodal mass distribution 

We show the mass density model in Figure [7] and crit- 
ical lines predicted by the model at z = 0.702, 2.515, 6.7 
in Figure [21 The total projected mass as a function of 
radius, centered on the BCG, is shown in Figure [H We 
find that the mass distribution is strongly preferred to 
be bimodal, even for a simple model where all the galax- 
ies are modelled based on the scaling relations, but the 
dark matter clumps are optimized independently using 
only systems 1 and 2 as constraints (28 constraints in 
all). The second dark matter clump is located to the 
south east of the central clump for this simple model, 
near galaxy #617, which is in agreement with previous 

* A parameter file containing all the following information, and 
which can be used with the Lenstool software package, along with 
a FITS file of the mass map generated from the best-fit model are 
available at http://archive.dark-cosmology.dk/ These can be 
used to find relevant critical lines for using AbeU 2218 as a gravita- 
tional telescope and to predict /confirm candidate lensed systems. 



models of Abell 2218. The bimodal mass distribution is 
also strongly preferred when we add more constraints. 
We will refer to the two large scale dark matter clumps 
as DM1 (for the one associated with the BCG) and DM2 
(for the one associated with galaxy #617). We also con- 
structed a three clump model, but its bayesian Evidence 
was worse, leading us to reject it as our best model. 

In previous models, the location of the second clump 
has been fixed at the center of the brightest galaxy in 
the south east corner (#617), but we find that when the 
location is allowed to vary it is offset from this galaxy by 
~ 35". We also find that DM2 has high ellipticity and 
is comparable in mass to DM1, although significantly 
less massive than the DM1 and BCG halos combined 
(see Figure [8]). We note that the light distribution is 
similar to the derived matter distribution (see Figure [7]), 
supporting the finding that there is a significant matter 
component in the vicinity of DM2 (see also section [8]) . 

5.2. Dark matter halos of galaxies 

The potential contribution of the individual dark mat- 
ter halos associated with clust e r gala xies was first pro- 
posed by iNatarajan fc KneTbl (|1997t ). Typically it is 
found that contribution of dark matter halos associated 
with the bright, early- type cluster galaxies in the in- 
ner regions are required to explain the positions and ge- 
ometrie s of multiply lensed ima ges in the strong lensing 
regime ()Meneghetti et al.|[2007l) . 

We fit three galaxies individually (for the parameters 
of these galaxies see Table[2]), two are the brightest galax- 
ies near the centers of the large scale dark matter clumps 
(#1193 - the BCG - and #617), while the third is an el- 
liptical galaxy important to the lensing of system SI. 
The halo associated with the BCG is very massive, and 
in particular in the inner 10" it dominates the mass dis- 
tribution (see Figure [8]). Even at the outermost Einstein 
radii of a multiply imaged system (80") it still contributes 
around 10% of the mass of the cluster. 

System SI has seven images, and two of the pairs are 
strongly affected by nearby galaxies (an elliptical #1028 
and a spiral #993). Keeping the parameters of the spiral 
free did not significantly affect the model, while adding 
the elliptical did. Although both galaxies would at first 
glance appear equally important to be fitted individually 
due to their strong effects on system SI, Figure [9] pro- 
vides an important insight to why only #1028 needs to 
be included individually. This is because the scaling rela- 
tions (see § I4.ip assume that the galaxies follow a Faber- 
Jackson relation (Equation [5]) without an y scatter. As 
can be seen from Figure [9] and discussed inl Ziegler et al.l 
(^OQl?) the galaxies in Abell 2218 show a significant scat- 
ter and therefore the above scaling relations can give 
very unrealistic values for individual galaxies which lie 
far from the mean behavior. The elliptical #1028 is one 
such galaxy, with a measured central velocity dispersion 
which is greater than the central velocity disper sion of 
the BCG as measured bv I J0rgensen et al] ()1999f ). while 
#993, although a spiral galaxy, is more consistent with 
the mean. 

For the other cluster galaxies, they are included in 
the model via the scaling relations given in Section 14.11 
For L* corresponding to if = 15 at z = 0.171 we find 
cr*pjg = 185t\i km s-i and s* = 2'.'9to% but we note 
that there is some degeneracy between the two values 
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TABLE 2 

Model parameters 



Clump 


A R.A. 

// 


A Dec. 
// 


e 




0"dPIE 

(km/s) 


a 

(") 


s 

(") 


DM1 




23 1+1'^ 
^■^■-1-0.7 


n33+'' 03 




8061:^0 


26+1 


274+110 


DM2 


-20 4+"'^ 


no 1 +0.6 
^•^•1-0.6 


39+0.01 1 




1029ti^ 


44+2 


462l«5 


1193 (BCG) 


-0.5 


0.07 


0.46 


52.4 




n 14+0.04 


56+12 


617 


-46.1 


-49.1 


0.20 


59.4 


288tg 


9 c+0.3 
^•^-0.5 


87+10 


1028 


-16.0 


-10.3 


0.18 


80.4 


3qfi+13 
■jy"_l28 


3+°-2 


^■"-0.1 












""dPIE 


a* 


s* 












(km/s) 


(") 


(") 


L* galaxy 










1851^? 


0.09 


2 9+0.5 



Note. — Values quoted without error bars were kept fixed in the optimization. The error 
bars correspond to 68% confidence levels. The location and the ellipticity of the matter clumps 
associated with the cluster galaxies were kept fixed according to the light distribution. The 
center is defined at (R.A., Dec.)=(248.9546, 66.2122) in J2000 coordinates. 

t The posterior probability distribution pushes towards the highest values of our input prior. 
Therefore these error bars do not represent the full uncertainty in the value, but the uncertainty 
given a prior of e < 0.4 




Fig. 7. — The mass density map and its contours (black). The maps are 300" X 300" and centered on the BCG with North being up and 
East being left. The red contours show the light distribution (left panel) and the X-ray distribution (right panel). The overall shape of the 
light-contours and the mass map contours are similar, with the light being slightly more 'pear'-shaped, as it broadens in the SE direction. 
The overall shape of the light-contours and the mass map contours also agree, although the X-rays become more spherical for in the outer 
regions. See section [8] 



(see discussion in section |6|). 

5.3. Comparison with measured velocity dispersions 

We use the velo city dispersion measuremen ts, co, of 
IZiegler et all (|2001[ ) and lJ0rgensen et all (|1999f) for Abell 
2218 to compare w ith the results from the lens model. 
IZiegler et al.l (|2001h have a total of 48 galaxies in their 
sample, of which nearly half fall within the ACS im- 
age and are included i n our galaxy catalogue, while the 
iJeirgensen et al.l (|1999f ) data contains the BCG and seven 
other cluster members in our galaxy catalogue. Us- 
ing the velocity disper sion measurements f or Abell 2218 
cluster members from IZiegler et al.l ()2001| ) we plot the 
Faber-Jackson relationship for the K-band data in Fig- 
ure [HI We then calculate the mean and the standard 
deviation of galaxies with K-band magnitudes in the 
range from 14.8 to 15.2, excluding the individually fit- 
ted galaxy (galaxy # 1028 in our catalog , or #1662 in 
the notation used bv IZiegler et al.l (|2001l )). This gives 



cTg 2iegier ~ ^95 ± 35 km s ^ for a typical K=15 galaxy. 

In Fi gure [TUI w e plot (Tq from IZiegler et al.l ()200lD and 
lJ0rgens en etaL| (|1999| ) vs. the fiducial velocity disper- 
sion, (TdpiE for the BCG, the two individually fitted 
galaxies and L*. Although the values from direct ve- 
locity dispersion measurements can not be directly re- 
lated to the values obtained from the dPIE profile (as 
the measured values are calculated based on an isother- 
mal profile and are not aperture corrected) we find that 
they are consistent with CTdpiE being related to the mea- 
sured v eloci ty dispersion by ao ~ 0.85(TdPiE as found in 
Section lAJl 

5.4. Redshift estimates of the new candidate systems 

We estimated the redshifts of the new candidate sys- 
tems using the model predictions. The three component 
candidate system C2 is found to have z = 2.6 ± 0.1. For 
the merging arcs we find z — 2.8±0.6 for C3, z = 2.2±G.2 
for C4, z = 2.6±0.3 for C6 while C5 is poorly constrained 
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Fig. 8. — Total projected mass as a function of aperture radius 
(centered on the BCG) for different model components. The two 
large scale clumps, DM1 and DM2, contribute a similar amount 
to the mass, but the halo associated with the BCG dominates in 
the inner regions, and the combined halo of DM1 and the BCG 
dominates the mass in the region where most of the constraints lie 
(within 80"). The galaxies (excluding the BCG) only contribute a 
small amount to the overall mass, of the order of 5 — 6%. 
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Fig. 9. — The Fab er-Jackson re l ation for the K-band using 
ao measurements from IZi egler et al.l 1)200 Ih . Also plotted is the 
central velocity dispersion of the BCG (#1193) as obtained by 
IJeirgensen et al] fl999 ). The individually marked galaxies are dis- 
cussed in section [4 . 1 1 and 15.21 




300 400 
CdPiE (km/s) 



Fig. 10. — The velocity dispersion of the dPIE, (TdPiE vs. (tq mea- 
surements from Zicglcr ct al. ( 20011) and [J0rgcnscn ct al. (1999). 
The line is not a fit, but shows the relationship ctq = 0.85(T;jpje 
found in Section lA. 71 



Note. — The ^^^d i'™'^ the source 
plane (where the model is optimised) and the 
image plane. The zero values found for S8 
mean that the system is consistent with being 
singly imaged. 




Fig. 11. — A plot of the source plane at 2; = 2.515 showing 
systems S2, S4 and S5 and the caustic lines. Their maximum 
separation is around 130 kpc, consistent with them belonging to the 
same group of galaxies. Also shown are the candidate systems C2, 
C3, C4, C5 and C6 when their redshift is assumed to be z = 2.515. 
Under that assumption, their locations are consistent with them 
also belonging to this group of galaxies. 

with z = 2.3 ± 0.8. The estimated redshifts are reported 
in Table |TJ These redshifts are consistent with a prelimi- 
nary photometric redshift analysis done using the Hyperz 
photometric redshift code (BolzoncUa ct al. 2000^. 

5.5. A strongly lensed galaxy group at z—2.5 

Three multiply imaged systems, S2, S4 and S5, all have 
the same redshift oi z — 2.515. To check whether these 
three systems belong to a background galaxy group, we 
lens them back to their source plane (see Figure [TT]). 

We find that S4 is in the middle with S5 and S2 at a 
separation of 5'.'4 and llX'S respectively, with the maxi- 
mum separation being 15'.'7, corresponding to 127 kpc in 
the source plane (1" — 8.06 kpc at z = 2.515), suggesting 
that the three systems belong to the same background 
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gr oup of galaxies. Thi s is in agreement with the findings 
of iKneib et all (|2004b[ ) who found the maximum separa- 
tion of the systems in the source plane to be 130 kpc. 
It may be of interest to do a dedicated search for more 
systems at z — 2.515, either multiply imaged or singly 
imaged, to further study this high redshift group of galax- 
ies, and we note that all the candidate systems have a 
predicted z consistent with z — 2.515. If we assume that 
these candidate systems have z = 2.515, their location 
in the source plane is consistent with them belonging to 
this same group (see Figure fTTj) . 

5.6. Comparison with previous results and weak lensing 

Although the overall results of our model are in agree- 
ment with previous models of Abell 2218, they have 
found the second clump to be less mass i ve (see e.g., 
Kneib et al.l 119961: lAbdelsalam et all 119981: ISmith et all 



20051: iNataraian et al.ll2007[ ). There are several possi- 



ble reasons for the discrepancy, with the first one be- 
ing the new constraints we use in this model. We have 
therefore redone the modeling using only the previously 
known spectroscopically confirmed systems, but we still 
find that DM2 is massive and with a large core. It is 
also possible, that the previous models constructed using 
the older version of the Icnstool package which did not 
involve MCMC sampling, have been caught in local min- 
ima. Indeed, we do find when forcing DM2 to be smaller, 
a comparable but with the posterior probability dis- 
tribution of the core radius a pushing toward the upper 
limit of the input range. However, this can not explain 
the discrepancy found f or the non-parametric model of 
lAbdelsalam et all (|1998D . 

A t hird possible exp l anatio n is that th e mod - 
els of Ebdelsakm et J ] (|1998f ): ISmith et all (|2005( ): 
iNatarajan et al.l ( 20071 ) all incorporated weak lensing to 
'anchor' the outer part of the mass distribution. Us- 
ing our model to predict the weak lensing shear profile 
at large radii, we find that our profile ov erestimates the 
signal compared to the measured signal of lBardeau et al.l 
(|2007f ) from ground based observations with the Canada- 
France-Ha waii Telescope ( s ee Fi gure [T2|) . The central 
part of the iBardeau et al.l (|2007t l profile is flat, charac- 
teristic of contamination of the backgro und galaxy cat- 
alog by foreground cluster members (see iLimousin et all 
(|2007br> for discussion on contamination). Therefore, we 
do not expect to have an agreement between the weak 
lensing result and the strong lensing result. At around 
~ 300 — 400" the contamination should be negligible, 
and there we find that the agreement is better although 
our model still overpredicts the signal (although they are 
consistent within 2a) . We stress however that at this ra- 
dius, we are extrapolating a strong lensing model based 
on constraints within 100", and the prediction becomes 
more uncertain the further we go out. 

6. DEGENERACIES 

In Section |5| we presented the strong lensing model of 
Abell 2218. In this Section we study the degeneracies 
of our parametric strong lensing modeling, both those 
inhere nt to the pa rametric profile and the model compo- 
nents. I Julio et all (2007) have also discussed the various 
degeneracies of the dPIE profile in lensing, addressing 
how different image configurations can break some de- 
generacies. 



al] 

if 



R foresee") 

Fig. 12. — The weak lensing signal predicted by our model 

i solid line) compared to the weak lensing found by 'Bardeau et al ' 
20011 ) (dotted Hne - Ict error bars). The Bardeau ct al. (200^ 
shows a very flat inner profile, characteristic of contamination of 
the background galaxy catalog by cluster members. Therefore, 
we do not expect an agreement in the inner regions. In the outer 
regions, where the contamination should be negligible, we find that 
our model overpredicts the weak lensing signal, but we note that 
the prediction is an extrapolation of a strong lensing model with 
constraints within 100". 
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Fig. 13. — The degeneracy between the scale radius, s, and ctjjpjb 
for #102 8 (lef t) and the scaled galaxies (right). These arise from 
equation I AlOl which gives the aperture mass, showing that to keep 
the enclosed mass constant, an increase in the scale radius requires 
a lower value of CTjipje. The contours correspond to Icr, 2a and Scr 
confidence levels. 

Lensing most strongly constrains the projected mass, 
and therefore we expect to see degeneracies arising from 
Equation lAlOl although ellipticity m ay complica t e that 
picture further. In agreement with I Julio et al] (|2007l ) 
we find for the large scale halos, that the scale ra- 
dius, s, is poorly constrained (as it lies beyond the 
outermost multiply imaged system). This is also the 
case for the BCG and the #617 which have large scale 
radii. For the smaller halos, i.e., the scaled galax- 
ies and #1029, the scale radius, s, is small enough to 
affect the projected mass, and we find that lower s 
requires higher CdPiE to keep the mass constant (see 
Figure [T3|). However, the favored region for s is al- 
ways small, consistent with the tidal st r ipping of clus ter 
galaxies proposed by Nataraian et al.l (|1998l [2b02al |bh: 
ILimousin et all (j2007al |cD. ." ^ 

Also, in agre ement withljullo et al.l (j2007f ). and as dis- 
cussed by e.g. lKochanei3 (|1996l ) for more general cored 
profiles, a larger core radius, a, requires higher (TdpiE to 
keep the mass constant (see Figure [T4)) . As our model has 
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Fig. 14. — The de gener acy between the core radius, a, and (TdpiE for DM1, DM2, BCG, #1028, #617 from left to right. These again 
arise from Equation lAlOl requiring the aperture mass to remain constant. They show that t o keep the mass constant, the ctjjpie must 
in crease when the c ore radius a is increased. This is consistent with the findings of lJuUo et al.l IpOO?) for the dPIE profile and the findings 
of lKochaneg (11993 ) for more general profiles with core. The contours correspond to Ifj, 2a and 3cr confidence levels. 
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Fig. 15. — The 2D posterior distribution of the parameters of 
DM2 vs. DM1 and BCG. There is a clear degeneracy between 
the parameters of the two large scale components and the BCG. 
Therefore, although the model prefers the inclusion of all the com- 
ponents, the values of their parameters are not fully independent. 
The contours correspond to la, 2a and 3cr confidence levels. 

both a large core radius a and (TdPiE for DM2 compared 
to previous models of A2218, we have explored whether 
this degeneracy can reduce both values. However, when 
forcing both a and CTdpiE to be smaller, the posterior dis- 
tribution of a always pushes to the upper limit of the 
input range. Therefore, we conclude that this degener- 
acy is not the explanation for the high a and ddpiE we 
find for DM2. This flat profile is further supported by 
the 'blind tests' we perform in Section FTSl 

In addition to the degeneracies associated with the pro- 
file itself, there may be degeneracies associated with the 
components we include in the model, i.e., are all the com- 
ponents we include necessary and are they independent 
of each other? As mentioned in section 15.11 the model 
is strongly preferred to be bimodal. There is however 
degeneracy between the parameters of DM1 (and the 
BCG) and DM2 which is visualized in Figure [T5l As for 
the BCG, we find that the Evidence is marginally higher 
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Fig. 16. — The 2D posterior distribution of the scaled galaxy 
parameters (a* , s*) with respect to the core radii of the large scale 
dark matter clumps DM1 and DM2. Although there is a degener- 
acy between the parameters, the scale radius s* is well constrained 
at low values. The contours correspond to la, 2a and 3a confidence 
levels. 

when both the DM1 and BCG are included in the model, 
suggesting that the data are sufhcient to model both sep- 
arately. Therefore we conclude, that although the model 
prefers the inclusion of all three components, the values 
for their parameters are not fully independent. Finally, 
we look at the parameters for the scaled galaxies with re- 
spect to DM1 and DM2 (see Figure [IB)) . Although there 
is degeneracy the preferred value, in particular for the 
scale radius s*, is well constrained at low values (< 4"), 
consistent with the tidal stripping scenario. 

7. RELIABILITY OF THE MASS MAP 

We have presented the mass map inferred from our 
strong lensing analysis in Section [5] (Figure [7]). In short, 
we find evidence for a bimodal mass distribution de- 
scribed by two large scale smooth dark matter clumps, 
on top of which we add some extra mass associated with 
the cluster galaxies. Moreover, we associate a significant 
mass concentration with the location of the BCG galaxy. 
These two conclusions are believed to be robust, and the 
aim of this section is to perform a series of tests in or- 
der to check our main findings regarding how the dark 
matter is distributed in Abell 2218. 

7.1. Do we need the smoothly distributed dark matter 
component? 

We wish to test whether the smoothly distributed com- 
ponent is necessary to reproduce a good fit, or whether 
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Fig. 17. — Left panel: The mass map from Section |7. II using only halos associated with the galaxies. Right panel: The mass map from 
Section [5] consisting of large scale dark matter halos and galaxy halos. The mass map including the large scale dark matter halos, which 
is much smoother, provides a significantly better fit to the data than the clumpy galaxy-only model (rmss = 0'.'12 vs. rmss = 1'.'62). The 
maps are 300" X 300" and are centered on the BCG with North being up and East being left. 
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Fig. 18. — The total mass as function of radius (centered on 
the BCG) for the two models. Although the total mass within 
any given radii is similar, the pure-galaxy-mass map gives a signif- 
icantly worse fit to the data (rmss = 0.12 vs rmss = 1.62), con- 
firming the need for a large scale dark matter halo to accurately 
fit the data (see Section |7.1|I . 

mass associated with the galaxies alone can provide an 
equally good fit. To this end, we construct a model where 
the mass is only in halos associated with the galaxies, 
without the inclusion of any smoothly distributed dark 
matter component. The galaxies are included in a scaled 
manner, allowing the ct^pie ^^'^ ^* move to higher val- 
ues to increase their mass. As a result of the optimiza- 
tion, we find a very poor fit (rmSs = 1'.'62 instead of 0'.'12 
for the model from Section [5]). As the model from Sec- 
tion [5] individually fits three of the galaxies, we need to 
check whether this difference arises from these extra free 
parameters. We therefore redo the analysis using only 
DM1 and DM2, and scaling all the cluster galaxies. The 
resulting fit is worse than the original (rms^ = 0'.'22) but 
still significantly better than the fit without any smooth 
component. 

The resulting mass map is shown in Figure [17] along- 
side our model from Section [5l The mass map without a 
smooth component is very 'clumpy', whereas the latter 
is smooth. As expected, the enclosed mass derived from 



each model is comparable for any radius where we have 
observational constraints (see Figure fT8|) . Therefore, the 
poorness of the 'clumpy' fit is not due to the fact that 
it is not massive enough to reproduce the lensing con- 
straints. Moreover, it is worth noting that the 'clumpy' 
model is not very satisfactory in the sense that it de- 
scribes the cluster galaxies as being very massive (around 
a few times IO^^Mq on average), which is not compatible 
with independent galaxy-galaxy lensing probes of cluster 
galaxy masses (see e.g., iMandclbaum ct al. 2005). We 
therefore interpret the difference in the goodness of fit as 
evidence for the dark matter being distributed smoothly 
in the cluster, with only small perturbations from the 
cluster galaxies. This is further supported by the X-ray 
emission (see Section [S]). 

7.2. Sensitivity to the galaxy scale perturbers 

We have already seen that individual galaxies are im- 
portant to the overall lens model if they are close to a 
multiply imaged system (see also Section 17. 3p . but in 
general the cluster galaxies only add small perturbations 
to the overall mass distribution of the cluster, with ~ 5- 
6% of the total mass being associated with the galaxy 
sized halos (excluding the BCG, see Figure [5]). To check 
whether those small scale perturbations are important, 
we try using a subset of the galaxy catalog to see if this 
affects the results. Our original catalog contains 197 
cluster members down to K = 19.6 (including the in- 
dividually fitted galaxies), but we also create catalogs 
using a cut off magnitude of K = 19, 18, 17, 16 with 
145,110,62,35 members respectively. We find that the 
overall quality of the fit is only weakly affected (see Ta- 
bled]), suggesting that for the purposes of lensing studies, 
it is sufficient to include the brightest galaxy members in 
the modeling in addition to those which clearly locally 
perturb given multiply imaged systems. This point is 
important in order to save computing time, as increased 
number of clumps, even if modelled by scaling, signifi- 
cantly increases the required CPU (Central Processing 
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TABLE 4 

Goodness of fit for different cut off magnitudes 



Cut off magnitude in K # of galaxy members log E rmss {") rms; (") 



19.6 197 -205 0.12 1.49 

19.0 145 -192 0.12 1.05 

18.0 110 -238 0.14 1.77 

17.0 62 -284 0.17 1.87 

16.0 35 -216 0.12 1.18 



Note. — The x and Evidence (E) for different cutoff magnitude in the galaxy 
catalogue. All the models are optimised in the same way, the only difference being 
the nu mber of galaxy members which are optimised using the scaling relations in 
section 14.11 



Unit) time. 

7.3. Blind tests - can we localize galaxy scale 
subtructure? 

Normally galaxy sized perturbers are added to the 
model by placing a dark matter clump at the location 
of a known cluster member. This method is by construc- 
tion not able to detect dark substructure directly. To 
test how sensitive we are to these substructures, lumi- 
nous or dark, we perform 'blind tests', where we model 
the cluster with (A) five and (B) ten freely placed galaxy 
sized dark matter clumps, in addition to the two large 
scale dark matter clumps DM1 and DM2. 

In order to limit the number of free parameters, we fix 
both a = 1 kpc and s = 40 kpc for the galaxy sized dark 
matter clumps, but we optimize all the parameters of 
the large scale clumps as before. We allow the positions 
of the galaxy scale clumps to vary from —100" to 100", 
corresponding roughly to the ACS field of view. The 
velocity dispersion, (TdPiE, of each clump is allowed to 
vary from 0-500 km s^^. 

The results are shown in Figure 1191 showing a den- 
sity plot of the position of the clumps for the 1000 best 
realizations (in the lowest sense), for both configu- 
rations (A) and (B). The locations of the clumps cor- 
responding to the best realization are marked by stars. 
We find that both configurations give a good fit to the 
data (rmSs = 0.17 and rmSs = 0.19 for (A) and (B) 
respectively). Configurations (A) and (B) are consis- 
tent with each other: setup (A) finds five well defined 
clumps. Setup (B) finds the same well defined five 
clumps, whereas it tries to marginalize the five extra 
clumps that seem not to be needed in the optimization. 

For both configurations, the model places two large ha- 
los where the BCG is, clearly demonstrating the need for 
significant mass at the location of the BCG galaxy. An- 
other common feature, is the lack of a galaxy sized clump 
in the vicinity of DM2, suggesting a very flat profile is 
preferred there. Both configurations place clumps in the 
positions of galaxies #1028 and #993, which are respon- 
sible for the local splitting of SI. This shows that lensing 
analysis can reliably detect galaxy sized dark substruc- 
ture if it causes the local splitting of images, although 
the lensing map is not very sensitive to galaxy sized sub- 
structure in general (see also Section [7?2]) . 

8. BIMODALITY OF THE MASS DISTRIBUTION: 
EVIDENCE OF A MERGER 

Our strong lensing analysis shows that Abell 2218 has 
a bimodal mass distribution: even if the dominant mass 



component is the BCG and DM1 halos, the second large 
scale dark matter clump DM2 is also significant, con- 
tributing around 20% of the total mass within 100". This 
second smooth mass component is associated with the 
bright cluster galaxy in the south-east that we call #617. 
To further interpret this bimodality, we look at two dif- 
ferent available probes of the cluster: the X-ray map and 
the velocity of cluster members. 

The X-ray map The X-ray fiux map in the cen- 
tral parts of the cluster shows a complex 
morphology with no outstanding central peak. 
The offset between the X-ray peak at (R.A., 
Dec.)=(248.967, 66.210) and the BCG is signifi- 
cant, ~ 20", with the X-ray peak located in the 
direction of DM2, but there is no evident peak 
in the X-ray emission in the vicinity of DM2. 
The X-ray flux map within about 1 arcmin of the 
peak of the X-ray emission is clearly elongated in 
the SE-NW direction, but becomes more spheri- 
cal with increasing distance from the cluster cen- 
ter (see Figure [T]). Comparing the contours for 
the X-rays and the mass map we see that the 
elongation of the two are very similar, although 
the X-rays become more spherical at large radii. 
To get more quantitative values for comparison of 
the two maps, we fitted a 2-dime nsional (3 model 
(jCavaliere fc Fusco-Femian"olll978[ ) to both maps. 

For the X-ray map, the best fit beta-model to the 
inner 2 arcmin by 2 arcmin (centered on the X-ray 

centroid) has an eccentricity {e^ = -^/l — (S/A)^) 

of e^(inner) = 0.27 ± 0.12 and a position angle 
of ^?x(h^ner) = 39° ± 16° (measured anti-clockwise 
from West). Fitting to the the 4 arcmin by 4 ar- 
cmin X-ray map (centered on the X-ray centroid) 
results in eccentricity of e^(outer) — 0.18 ± 0.08 
and a position angle of 0x (outer) = 17° ± 14°. A 
similar analysis for the mass map in a 5 arcmin 
by 5 arcmin (centered on the BCG) gives an ec- 
centricity of the overall mass map as e^^j^ss ~ 0.25 
with ^mass ~ 39°, consistent with that found for 
the X-ray map, in particular in the inner regions. 

Distribution in velocity space If a merger has taken 
place in Abell 2218, then this should be imprinted 
in the velocity distribution of the cluster members: 
we should be able to identify two structures in ve- 
locity space; one associated with the BCG galaxy, 
and one associated with galaxy #617. 
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Fig. 19. — Density plots for the positions of the galaxy sized clumps for the models discussed in section[7]3] The locations of the clumps 
for the best model are marked with red stars. The size of the stars is proportional to their velocity dispersion. The two big black stars 
show the two large dark matter clumps, their size again proportional to the velocity dispersion. Left panel: Configuration (A): The model 
consists of five freely placed galaxy sized halos in addition to DM1 and DM2. Right panel: Configuration (B) The model consists of ten 
freely placed galaxy sized halos in addition to DM1 and DM2. Both configurations place galaxy scaled clumps where the BCG and the 
galaxies responsible for the local splitting of SI are located. The maps are 200" X 200" and are centered on the BCG with North being up 
and East being left. 

model is based on 7 multiply imaged systems and 1 arc 
with spectroscopic redshifts, and 6 systems without spec- 
troscopic redshifts, of which 5 are new candidate systems 
proposed in this work. The model is sampled and op- 
timized in the source plane by a Bayesian Monte Carlo 
Markov Chain implemented in the publicly available soft- 
ware Lenstool. Our best model has rmSs = 0'.'12 in the 
source plane, corresponding to rms^ = 1'.'49 in the image 
plane. 

We have found, in agreement with previous models of 
Abell 2218, that the mass distribution is bimodal. We 
find that DM2 is larger and with a flatter core than 
previous models have found. The flatness of the pro- 
file near DM2 is further supported by our 'blind tests', 
which do not place a galaxy sized component near its 
center. The BCG and DM1 are the dominant compo- 
nent of the mass model in the inner regions (< 100") of 
the cluster. We have analyzed the distribution of galaxies 
in velocity space, finding evidence for two substructures, 
separated by 1000 km s~^, corresponding to DM1 and 
DM2. Although both the light and the X-ray contours 
are consistent with the mass map, the center of the X-ray 
emission is offset from the central peak of the BCG. We 
find that the X-ray data and the distribution in velocity 
space support the interpretation that the bimodal mass 
distribution arises from a cluster merger. 

We have explored the degeneracy of the mass model, 
both those inherent to the dPIE profile and those aris- 
ing from the mass model components themselves. For 
the dPIE, our re sults are in agreement with those of 
lJullo et all ()2007[ ). We find that the large scale dark 
matter clumps are a necessary component of the model, 
i.e., using only the dark matter halos associated with the 
galaxies does not give a good fit to the data (rmSs — 1'.'62 
vs. imSs = 0'.'12), even if they are allowed to become 
much more massive. 

At around 100" the two large scale halos contribute 
~ 85% of the enclosed projected mass, while the BCG 
contributes ~ 9% and the remaining cluster galaxies 
^ 6%. We have performed 'blind tests' to check where 
the model requires a galaxy scale component to repro- 
duce the lensing constraints, and find that both the BCG 



iGirardi et al.l ()1997[ ) studied the structure of 
Abell 2218 using the spectroscopic data from 
iLe Borgne et al.l (|1992f ). They found evidence for 
two structures (labelled MSI and MS2) separated 
by 2000 km s~^. The larger of these two struc- 
tures, MS2, contains both the BCG and the bright- 
est galaxy in DM2. The two structures are super- 
imposed along the line of sight and do not corre- 
spond exactly to the clumps found for the strong 
lens modeling. Such superimposed structures are 
hard to separate using lensing as it is not sensitive 
to the 3D distribution of the matter. 

To look for substructure associated with the BCG 
and #617 we have calculated the separation {Av = 
c{zi — Zj)/{1 -t- Zi)) between them and the other 
cluster members using spectroscopic data from 
iLe Borgne et al.l ()1992l ). Out of the 50 galaxies in 
the catalog we found 13 with Aw < 500 km s~^ 
and 25 with Av < 1000 km s^^ for the brightest 
galaxy in DM2 while for DM1 the numbers were 
11 and 21 respectively. The Aw < 500 km s~^ cut 
defines two structures without any common mem- 
bers, while the Av < 1000 km s~^ cut starts mixing 
the two groups. While all the galaxies we find with 
a Aw < 500 km s^^ for t he BCG and #617 w ould 
belong to MS2 found bv IGirardi et all (|1997t ). the 
data suggests that MS2 may be further subdivided 
into two smaller structures, corresponding to the 
DM1 and DM2 found by the lens models. 

The X-ray data and the distribution of the cluster 
members in velocity space, indicates that the bimodal 
mass distribution is caused by a merger. The main clus- 
ter is the one associated with the BCG (DM1 and BCG 
in the lensing analysis) which the second cluster associ- 
ated with #617 (DM2 in the lensing analysis) has merged 
with, thus displacing the center of the X-ray peak from 
the BCG galaxy. 

9. CONCLUSIONS 

We have reconstructed a mass map of the rich galaxy 
cluster Abell 2218 using strong lensing constraints. Our 
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and galaxies which locally perturb given multiply imaged 
systems are reliably reclaimed. However, we find that 
the inclusion of the cluster galaxies (excluding the BCG) 
only weakly affects the model unless they locally perturb 
a multiply imaged system. Assuming that mass scales 
with light, this shows that strong lensing constraints can 
reliable detect substructure, dark or luminous, if the sub- 
structure is massive or locally perturbs a system. 

The accurate mass map we have presented is made 
available to the community and can be used to exploit 
Abell 2218 as a gravitational telescope, probing the high 
redshift universe. In this work we have fixed the cos- 
mology, but with the increased number of constraints it 
may be possible to simultaneously fit it with the mass 
distribution. However, the bimodal structure of Abell 
2218, and the remaining uncertainty in the mass model 
(around ^ 1" in the image plane) may make it challeng- 
ing to reach competitive accuracy in the derived cosmo- 
logical parameters. 
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A. THE DUAL PSEUDO ISOTHERMAL ELLIPTICAL MASS DISTRIBUTION 

The Pseudo Isothermal Elhptical Mass Distribution (PIEMD) was fi rst introduced by iKassiola &: Kovneii (fl"993af) 
and it and its var i ants are frequently used in lensing analysis (see e.g.. iKneib et al.lll996t iKeeton fc KochaneklllQQSl : 
ISmith et all 120051 : iLimousin etaH l2007bl : iNatarajan et all l2007t iRichard et al.ll2007l ). The main advantage of us- 
ing the PIEMD for lensing analysis is that its potential, as well as its first and second partial derivatives, can be 
expressed a nalytically to derive deflections, distortions, amplifications and time delays for any ellipticity (see also 
iKeeton fc~ kochanck (1998)). One of the most commonly used variants, and the one used in this work, is a two 
component PIEMD with both a core radius and a scale radius. This profile is sometimes referred to as the "trunctu- 
ated PIEMD" or simply th e " PIEMD" , although it differs in significant ways from the original PI EMD proposed by 
IKassiola fc Kovnerl (|1993a[ ). An equivalent family of mass models was introduced independently bv lMerritt fc Valluril 
( 1991 " who used it as a starting point in an investigation of the orbital properties of triaxial stellar systems. In this 
appendix we give a self-contained description of this profile, which we will call the 'dual Pseudo Isothermal Elliptical 
mass distribution' (dPIE) and present its derived quantities most relevant to lensing. 



A.l. The 3D density profile 

The 3D density distribution of the dPIE is 

Pif) = T-, o , „ , ; s> a. (Al) 

' (1 + r2/a2)(l + r2/s2) ^ ' 

This distribution represents a spherical system with scale radius s, core radius a and central density po ^- In the center 
p w po/(l + r"^ /d^) which describes a core with central density po- The core is not strictly isothermal (in which case 
p ~ (1 -I- /a?)~^/'^ (i Binnev fc Tremaine ''l987')) but in the transition region, a < r < s, we have p ^ as for the 
isothermal sphere. In the outer parts the density falls off as p ~ r""*. 
The density distribution can be rewritten as 



with ^ = r/s and 77 = s/a. Thus s acts as a scale parameter and 77 is a shape parameter. 

A. 2. The 2D density profile 
The projected density, which is the relevant quantity for lensing, is given by 

00 



with 



E(i?) = 2/ -^M^d.^Eo^f^^-^^) (A3) 

So-TTpo— — . A4 
s + a 



and R is the 2D radius. For a vanishing core radius it reduces to 

which is the surface mass profile proposed bv lBrainerd et all (|1996f) for modeling galaxy-galaxy lensing. In this special 
case, a becomes a simple amplitude scaling parameter and, as shown below, s is the half-mass radius, r^. 
Defining the critical surface mass density 

where D^, Ds, and D^s a-re the angular diameter distances to the lens, the source, and between the lens and source 
respectively, the convergence is 

<R) - 1^ (A7) 

■^crit 



and the corresponding shear is 
So as 



7(i?) 



Scrit S — a 



(A8) 



® We note that s corresponds to rcut and a to rcoro in previous publications (e.g. ILimousin et al.l Il2007bl) : IJuUo et aLI II2007I )). but we 
adopt this new n otati on to avoid confusion, as s is not a 'cut-off' nor a trunctuation radius, but the a scale radius, with p ~ for r >> s 
(see also Section lA.3l for how the mass depends on s). 
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A. 3. Mass relations 

The mass inside physical radius r is 



M3D{r) = 4n p{f)f^df = 47rpo 2^2 (■sarctan ( - j — aarctan ( - ) ) (A9) 



and the mass inside projected radius R is 



M2d{R) =2tt Y,{R)RdR = 2nT,o-^ (\/a^ + R^ -a- \/s^ + R^ + s] . (AlO) 
Jo s — a \ J 



The dPIE has a finite total mass, given by 



Mtot = 27r^Po = 27rSoas. (All) 

a + s 



The half-mass radius defined by M^uiTh) = Mtot/2 obeys the relation 

sarctan ) — aarctan ) ~ 4 ~ (-^-'-2) 

For a<^Th-, s 

rft ~ s + |a (A13) 
i.e., rh> s with equality for a ^ 0. Similarly, the efii'ective radius defined by M2D{Re) = Mtot/2 is 



R^ = ^^a^ + }^as + s^ (AM) 



which for 77 ^ <C 1 (a <C s) reduces to 
i.e., i?e « (3/4)r,, for a ^ 0. 

A. 4. The potential 

The gravitational potential ^ = — $ is 



3 5 

Re^-s+-a, (A15) 



*(r) = -^Ar = 4.Gpo^ (- arctan (-) - - arctan (-) + - In j j (A16) 



2 2 
as , s 



and the central potential is 

*(0) = 47rGpo „," In 7 (A17) 

which is finite for finite a. 
The projected potential is given by 



^{R) = 2G [ 
Jr 



M2niR)^^ 



R 

= 47rGEo^^ + i?2 - + i?2 + aln(a + Va^T^) - sln(s + ^s^T^)) 

+ constant, (A18) 
where Rl is a limiting radius leading to the constant term. The deflection then is 

2DLsd^ SttGDls^ as 



where 



fiR/a, R/s) = { 1 . (A20) 
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A. 5. EUipticity 

An elliptical projected mass distribution with ellipticity e = {A — B)/{A + B), where A and B are the semi major 
and minor axes respectively, can be introduced by substituting R ^ R, with 

where X and Y are the spatial coordinates along the major and minor projected axes, respectively. With this definition 
of the ellipticity, the quan tity e= 1 — i3/^is related to e through e = 2e/(l + e). For further details we refer to 
iKassiola &: Kovrieil (|1993a[ ). and note that in their notation the model discussed in this paper corresponds to 

•^-.3/2 = ;3^(«"'4*,l/2 - '^-'lll/2)- (A22) 

and the analytical potential is given by solving 

SX^~ SX ' W^-^ SY ^ gx5Y " ^ 5X ~ 5Y ' 

A. 6. The dPIE in Lenstool 

The dPIE is incorporated into the Lenstool software using the full elliptical expressions. The profile is given by 
specifying 8 parameters, its redshift z, its central position (R.A.,Dec.), its ellipticity and orientation {e,9i), the core 
radius a, the scale radius s and a fiducial velocity dispersion UdPiE- This fiducial velocity dispersion is related to the 
deflection, a, by 

a = ^^^Eof{R/a, R/s) (A23) 

where 

iJo=67r^%^. (A24) 
Ds d- 

In terms of Sq, a and s (or po, a and s) (TdpiE is given by 

4 ObS^ 4 

The ellipticity is defined by 

^=A^ (A26) 

corresponding to 



e^l-^ = l-^. (A27) 

A 

Its direction, ^j, is measured anti-clockwise from the West and is related to the position angle (P.A.) by P.A.— — 90°. 

A. 7. Velocity dispersions 

Although UdPiE is a fiducial velocity dispersion, we wish to relate it to the measured velocity dispersion of galaxies, 
assuming that their profile is described by a dPIE. In this derivation we will look at the most simple case, assuming 
no anisotropy and a spherically symmetric dPIE profile. Even under these assumptions, the central velocity dispersion 
and the velocity dispersi on profiles for the dPIE ca nnot be expressed analytically and must be computed numerically. 
The velocity dispersion (jBinnev fc Tremainelll987[ l is 

and the projected (line-of-sight) velocity dispersion is 

2.m_ 2G r MMr)p{r 



"^(^) = ^A '^ v/^^d.. (A29) 

We can also compute the average line-of-sight velocity dispersion inside a given radius R as 

^"^^ AiMR) ■ ^ ' 

This is the quantity measured by velocity dispersion measurements, and for galaxies the relevant radius corresponds 
roughly to the slit width used for the spectrograph. For the galaxies in our model (see Table [5]) and a spectrograph 
resolution of ^ 1", this corresponds to an observable region of 0.01 < (R/Re) < 2. In Figure Al we plot the average 
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Fig. 20. — The squared measured velocity dispersion ((fp)) vs. the squared fiducial fjpjB- The ratio varies depending on both 
the observed radius R, and the ratio of a and s. For values similar to those found for the galaxies in the model (Table [2]l we find 



line-of-sight velocity dispersion over the fiducial velocity dispersion UdpiE for different values of rj. We find that for 
a/s values similar to those found in the modelling, \/ (cp) /(TdPiE ~ 0.85 ± 0.10 in the observable region. 

Finally, to facilitate comparison with velocity dispersion profiles from different profiles, we define a 'standard' 
velocity dispersion erg. This velocity dispersion is motivated by the velocity dispersion of the isothermal profile (for 
which = 2GM2d{R)/R for all R), calculated at the effective radius Re'- 



which goes to 2 when the core radius a goes to zero. For a/ s values similar to those found in the modelling Ce/CTdpiE ~ 





(A31) 



(A32) 



f^dPiE sv/«^ + (10/3)as + s2 



1.3 ±0.2. 
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